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Abstract 

Consider the problem in which n jobs that are classified into k types are to 
be scheduled on m identical machines without preemption. A machine requires 
a proper setup taking s time units before processing jobs of a given type. The 
objective is to minimize the makespan of the resulting schedule. We design and 
analyze an approximation algorithm that tuns in time polynomial in n, m and k 
and computes a solution with an approximation factor that can be made arbitrarily 
close to 3/2. 


1 Introduction 

In this paper, we consider a scheduling problem where a set of n jobs, each with an 
individual processing time, that is partitioned into k disjoint classes has to be scheduled 
on m identical machines. Before a machine is ready to process jobs belonging to a 
certain class, this machine has to be configured properly. That is, whenever a machine 
switches from processing a job of one class to a job of another class, a setup taking 
s time units is required. Meanwhile a machine is not available for processing. The 
objective is to assign jobs (and the respective setup operations) to machines so as to 
minimize the makespan of the resulting non-preemptive schedule. 

The considered problem models situations where the preparation of machines for 
processing jobs requires a non-negligible setup time. These setups depend on the 
classes of jobs to be processed (i.e. they are class-dependent), however, the required 
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setup time is class-independet. Also, jobs might not be preempted, e.g. because of ad¬ 
ditional high preemption costs. Possible examples of problems for which this model 
is applicable are (1) the processing of jobs on (re-)configurable machines (e.g. Field 
Programmable Gate Arrays) which only provide functionalities required for certain 
operations (or jobs of a certain class) after a suitable setup or (2) a scenario where 
large tasks (consisting of smaller jobs) have to be scheduled on remote machines and it 
takes a certain (setup) time to make task-dependent data available on these distributed 
machines. 

Surprisingly, although a lot of research has been done on scheduling with setup 
times, we are not aware of results concerning the considered model. This is due to 
the fact that the motivation for considering setup times are often related to preemption 
of jobs, which is not true for our model. We discuss some results on these alternative 
models in the following section on related work. Thereafter, we discuss some prelim¬ 
inaries and two simple algorithms in Section that include a fully polynomial time 
approximation scheme (FPTAS) for the considered problem if the number m of ma¬ 
chines is constant and a greedy strategy yielding 2-approximate solutions. Section]^ 
presents the main contribution of this paper which is an algorithm whose approxima¬ 
tion factor can be made arbitrarily close to 3/2 with a runtime that is polynomial in the 
input quantities n, k and m. Finally, in Section|^we introduce an online version where 
jobs arrive over time and shortly discuss how to turn, employing a known technique, 
our offline algorithm into an online strategy with a competitiveness arbitrary close to 4. 

1.1 Related Work 

The scheduling problem considered in this paper is a generalization of the classical 
problem of scheduling jobs on identical machines without preemption and in which 
setup times are equal to 0. This problem has been extensively studied in theoretical 
research and PTASs with runtimes that are linear in the number n of jobs are known 
for objective functions such as minimizing (maximizing) the maximum (minimum) 
completion time or sum of completion times i2l|5l. If the number m of machines is 
constant, even FPTASs exist 0. 

When setup times are larger than 0, the problem is usually refered to as scheduling 
with setup times (or setup costs). It has also been studied for quite a long time and there 
is a rich literature analyzing different models and objective functions. Usually models 
are distinguished by whether or not setup times are job-, machine- and/or sequence- 
dependent. For an overview on studied problems and results in this context the reader 
is refered to detailed surveys on scheduling with setup times iiiiia. We discuss some 
closely related problems in the following. In Q, Monma and Potts consider a model 
quite similar to ours but they allow preemption of jobs and setup times may be different 
for each class. They design two simple algorithms, one with an approximation factor 
of at most max{3/2 — 1/(4 to — 4), 5/3 — 1/m} if each class is small (i.e. setup time 
plus size of all jobs of a class are not larger than the optimal makespan), and a second 
one with approximation factor of at most 2 — l/([m/2j + 1) for the general case. 
Later, Schuurman and Woeginger a improve the result for the case that each class 
consists of only one job that, together with its setup time, is not larger than the optimal 
makespan. The authors design a PTAS for the case where all setup times are identical 
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and a polynomial time algorithm with approximation factor arbitrary close to 4/3 for 
non-identical setup times. 

A closely related problem was also studied in another context by Shachnai and 
Tamir fH. They design a dual PTAS for a class-constrained packing problem. In 
contrast to the basic bin packing problem, in this variant each item belongs to a class 
and each bin has an upper bound on the number of different classes of which items 
might be placed in one bin. 

The dual problem of our scheduling problem was studied by Xavier and Miyazawa 
and is known as class-constrained shelf bin packing. For a constant number of classes, 
an asymptotic PTAS is known for this problem lfT2l as well as a dual approximation 
scheme da, i.e. a PTAS for our problem if k is constant. 

Very recently, Correa et al. 13 studied the problem of scheduling splittable jobs 
on unrelated machines. Here, unrelated refers to the fact that each job may have a 
different processing time on each of the machines. In their model, jobs may be split 
and each part might be assigned to a different machine but requires a setup before being 
processed. For this problem and the objective of minimizing the makespan they show 
their algorithm to have an approximation factor of at most 1 + (j), where « 1.618 is 
the golden ratio. 

In m, an online variant of scheduling with setup times is considered. The authors 
propose a 0(l)-competitive online algorithm for minimizing the maximum flow time 
if jobs arrive over time at one single machine. 

2 Model & Notation 

We consider a model in which there is a set J = {1,..., n} of n independent jobs (i.e. 
there are no precedence constraints for jobs) that are to be scheduled on m identical 
machines M = {Mi,..., M^}. Each job i is available at the beginning and comes 
with a processing time (or size) pi G N>o. Additionally, the job set is partitioned 
into k disjoint classes C = (Ci,..., Ck}, i.e. J = IJ^^q Ci and Ci fl Cj =0 for 
all i ^ j. Before a job j G Ci can be processed on a machine, this machine has to be 
configured properly and afterwards jobs of class Ci can be processed without additional 
setups until the machine is reconfigured for a class Ci' ^ Ci. That is, a setup needs 
to take place before the first job is processed on a machine and whenever the machine 
switches from processing a job j G Ci to a job j' G Ci' with Ci ^ Ci'. Such a setup 
takes s G N>o time units and while setting up a machine, it is blocked and cannot do 
any processing. 

Given this setting, the objective is to find a feasible schedule that minimizes the 
makespan, i.e. the maximum completion time of a job, and does not preempt any job, 
i.e. once the processing of a job is started at a machine it finishes at this machine 
without interruption. 

In the following we refer to the overall processing time of all jobs of a class Ci as 
its workload and denote it w{Ci) := Pj assume that for all 1 < i < n it 

holds that w{Ci) < 'yOPT for some constant 7 and OPT being the optimal makespan. 
By abuse of notation, by w{Ci) we sometimes also represent (an arbitrary sequence of) 
those jobs belonging to class Ci. To refer to the class Ci of a job j G Ci, we use a 
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mapping c : J ^ C with c(j) = Ci and we say a job j G Ci forms an individual class 
if c~^{Ci) = {j}. The processing time of the largest job in a given instance is denoted 
by Pmax ■= maxi<i<„(pi). We say a machine is an exclusive machine (of a class Ci) 
if it only processes jobs of a single class (class Ci). 


3 Preliminaries & Warm-Up 

As a preliminary for our approximation algorithm presented in Section we need to 
know the optimal makespan before we can actually compute a schedule fulfilling the 
desired approximation guarantee concerning its makespan. However, this assumption 
is feasible and justified by the applicability of a common notion known as a-relaxed 
decision procedure Q. 

Definition 3.1. Given an instance I and a candidate makespan T, an a-relaxed deci¬ 
sion procedure either outputs no or provides a schedule with makespan at most a ■ T. 
In case it outputs no, there is no schedule with makespan at most T. 

Using such an a-relaxed decision procedure (that runs in polynomial time) to guide 
a binary search on an interval [Z, u] with OPT G [Z, u], we directly obtain a polynomial 
time approximation algorithm with approximation factor a. We can find a suitable 
interval containing the optimal makespan by applying a greedy algorithm that provides 
an interval of length OPT as follows. 

Lemma 3.2. There is a greedy algorithm with runtime 0(n} and approximation factor 
at most 2. 

( Aj s I ^ ^ ^ 

S + Pmax ) - 

bound on OPT. Now, consider the sequence 

w{Ci), s, w{C 2 ),s,w{Ck). 

Note that the length of this sequence is exactly {k — l)s + < rnT. Thus, if 

we split it at points IT,1 gN into blocks of length T, we obtain at most m blocks. We 
now transform each of these blocks in such a way that we obtain a feasible schedule for 
all jobs on m machines. To do so, we need to add at most one setup at time 0 on each 
machine. In case a job is split, we also add the remaining processing time of this job to 
the machine it started on and remove it from the machine where it should have finished. 
Hence, we obtain a valid schedule S with makespan S < T + s + Pmax — 1 < 2T 
which yields T < OPT <S<2T. □ 

For the sake of simplicity, we assume in the following that by means of this ap¬ 
proach we have guessed OPT correctly and show how to obtain an effective approxi¬ 
mation algorithm. Particularly, using the presented algorithm within the binary search 
framework as an a-relaxed decision procedure, provides the final result. 


j gives a trivial lower 
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3.1 Constant Number of Machines 

As a first simple result we show that the problem is rather easy to solve if the number 
m of machines is upper bounded by a constant. For this case we show how to obtain 
an FPTAS, i.e. an approximation algorithm that, given any £ > 0, computes a solution 
with approximation factor at most 1 + £ and runs in time polynomial in n, k and i. 
First of all, note that it is simple to enumerate all possible schedules. To do so, sort the 
set of jobs according to classes. Let Si,0 < i < n, be the set of all possible (partial) 
schedules for the first i jobs. Let 5o = 0 and ji,... ,jk, be the indices i at which 
there is a change from a job of one class to one of another in the ordered sequence and 
ji := 1. To compute Si, if i ^ ji, ... jk, consider each schedule in and for each 
possible assignment of job f to a machine for which a setup took place for f’s class c{i) 
put the corresponding schedule into Si (if the makespan is not larger than T, others can 
be directly discarded). If f = ji for some 1 < I < k, first compute all 2^ — 1 possible 
extensions of schedules in Si-i by setups for z’s class c(z) and then proceed as in the 
case before. Obviously, choosing a schedule S G Sn with minimum makespan yields 
an optimal solution. 

In order to obtain an efficient algorithm from this straightforward enumeration of 
all possible schedules, we first define some dominance relation that helps to remove 
schedules during the enumeration process for which there are other schedules that will 
be at least as good for the overall instance. 

Definition 3.3. After computing Si, a schedule S G Si is dominated by S' G Si if 

• S and S' have the same makespan on the first m—1 machines and the makespan 
of S' on the m-th machine is at most as large and 

• in case that c(i) = c(z + 1), in S and S' the same machines are set up for i’s 
class c{i). 

Note that by removing dominated schedules directly after the computation of Si 
and before the computation of we may reduce the size of Si without influencing 
the best obtainable makespan computed at the end in However, we cannot ensure 
that the Sfs have a small size. Thus, we consider the following rounding, which is 
applied before the enumeration: Round up s and the size pj of each job j to the next 
integer multiple of e^/(n+fc), where £ > 0 defines the desired precision of the FPTAS. 
As to any machine we assign at most n jobs and k setups, the rounding may introduce 
an additive error of at most e ■ T < e ■ OPT. Additionally, the rounding helps to 
make sure that each Si is not too large after removing dominated schedules. Due to our 
dominance definition, there are at most ("+fc)/£ different makespans that may occur in 
schedules in Si. Hence, there are at most 2™ • (n+k/s)"^ many schedules in Si that are 
not dominated, thus proving the following theorem. 

Theorem 3.4. If the number m of machines is bounded by a constant, there is an 
FPTAS with runtime Oinje + rz log n). 
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4 A (3/2 + £)-Approximation Algorithm 

In this section, we present the main algorithm of the paper. The outline of our approach 
is as follows: 

(1) We first identify a class of schedules that features a certain structural property and 
show that if we narrow our search for a solution to schedules belonging to this class, 
we will still find a good schedule, i.e. one whose makespan is not too far away from an 
optimal one. 

(2) We then show how to perform a rounding of the involved job sizes and further 
transformations and thereby significantly decrease the size of the search space. 

(3) Finally, given such a (transformed) instance, it will be easy to optimize over the 
restricted class of schedules studied in (1) to obtain an approximate solution to any 
given instance. 

4.1 Block-Schedules 

We start by discussing the question how to narrow our study to a class of schedules that 
fulfill a certain property and still, be able to find a provably good approximate solution. 
Particularly, we focus on block-schedules, which are schedules satisfying a structural 
property, and which we define as follows. 

Definition 4.1. Given an instance I, we call a schedule for I block-schedule if for all 
1 < i < m the following holds: In the (partial) schedule for the machines Mi ,..., Mi, 
there is at most one class of which some but not all jobs are processed on Mi,..., Mi. 

Intuitively speaking, in a block-schedule all jobs of a class are executed in a block in 
the sense that they are assigned to consecutive machines and not widely scattered. 

In order to prove our main theorem about block-schedules, we first have to take 
care of jobs having a large processing time in terms of the optimal makespan. Let 
Li = {j € Ci : \OPT — s < pj < \OPT} be the set of Zarge jobs of class Ci and 
Hi = {j € Ci : Pj > ^OPTj be the set of huge jobs of class Q. Based on these 
definitions we show the following lemma. 

Lemma 4.2. With an additive loss of s in the makespan we may assume that 

1. Each huge job forms an individual class, 

2. There is a schedule with the property that all large jobs of class Ci are processed 
on exclusive machines, except (possibly) one large job qi € Li, for each Ci, and 

3. Qi = argminj^j^,{pj} is the smallest large job in Ci and the machine it is pro¬ 
cessed on has makespan at most OPT. 

Proof. We prove the lemma by showing how to establish the three properties by trans¬ 
formations of the given instance / and an optimal schedule S for I with makespan 
OPT. To establish the first property, transform I into T by putting each job j G Hi 
into a new individual class, for each class Ci. Because any machine processing such 
a huge job j cannot process any other huge or large job due to their definitions, the 
transformation increases the makespan of any machine by at most s. 
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Next, we focus on the second property. In S no machine can process two large 
jobs of different classes. Hence, we distinguish the following two cases: A machine 
processes one large job or a machine processes at least two large jobs. We start with the 
latter case and consider any machine that processes at least two large jobs of a class C^. 
Because these two jobs already require at least 2 [(OPT+iy 2 — s] + s > OPT — s + 1 
time units including the setup time, no job of another class can be processed and thus, 
this machine already is an exclusive machine. On the other hand, if a machine Mp only 
processes one large job j G Ci, we can argue as follows. The machine Mp works on j 
for at least [(OfT+iy 2 j time units (including the setup). Thus, the remaining jobs and 
setups processed by Mp can have a size of at most . If there is still another 

machine processing a single large job of Ci, we can exchange these jobs and setups 
with this large job and both involved machines have a makespan of at most OPT + s. 
Also, the machine from which the large job was removed does not contain any huge or 
large jobs anymore ensuring there is no machine where this process can happen twice. 
We can repeat this procedure until all (but possibly one) large jobs are paired so that 
the second property holds since no machine is considered twice. 

Finally, to establish the third property, we can argue as follows: If the smallest large 
job Qi is the only large one on a machine in the schedule S, we can do the grouping 
just described without shifting qi to another machine satisfying the desired bound on 
the makespan. If qi is already processed on a machine together with another large job, 
we may pair the remaining jobs but (possibly) one (one that is not processed together 
with another large job on a machine). In case there is such a remaining unpaired job, 
we hnally exchange qi with the unpaired job. The resulting schedule fulfills the desired 
properties. □ 


We now put the smallest large job qi of each class Ci into a new individual class. 
Based on the previous result, there is still a schedule with makespan at most OPT + s 
for the resulting instance. 

In the next lemma, we directly deduce that there is a block-schedule with makespan 
at most OPT -f s if we allow some jobs to be split, i.e. some jobs are cut into two parts 
that are treated as individual jobs and processed on di ffere nt machines. To this end, fix 
a schedule S for I fulfilling the properties of Lemma 4.2 By M denote the exclusive 


machines according to schedule S and by Ci the class Ci without those jobs processed 
on machines belonging to M. 


Lemma 4.3. Given the schedule S fulfilling the properties of Lemma \4.2\ there is a 
schedule S' with makespan at most OPT + s with the following properties: 

1. A machine is exclusive in S' if and only if it belongs to M and the partial sched¬ 
ule of these machines is unchanged. 

2. When removing the machines belonging to M and their jobs from S, we can 
schedule the remaining jobs on the remaining machines such that 

(a) The block-property holds and 

(b) only jobs with size at most ^OPT — s are split. 
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Proof. Remove machines belonging to M and the jobs scheduled on them from the 
schedule S obtaining S. We now show that there is a schedule S' with the desired 
properties. Similar to 13 consider a graph G — {V,E) m which the nodes corre¬ 
spond to the machines in S and there is an edge between two nodes if and only if 
in S the respective machines process jobs of the same class. We argue for each con¬ 
nected component of G. Let m' be the number of nodes/machines in this compo¬ 
nent. Furthermore, let G' = {Cj,... C/} be the set of classes processed on these 
machines without those formed by single huge or large jobs and H = {hi, ..., hr} 
be the set of jobs processed on these machines that are either huge jobs or large 
jobs forming individual classes. Note that r < m' since all jobs of H must be pro¬ 
cessed on different machines in S. By an averaging argument we know OPT -f s > 

^ (LLi + LLi + il + r + m' - l)s^ and hence. 


I r 

w{C}) + {I — l)s < {m! — r)OPT + '^{OPT — w{hi) — s). (1) 

i=l i=l 

Consider the sequence s, s,..., s, w{C'i) of length + (^ ~ 

l)s and split it from the left to the right into blocks of length OPT — w{hi) — 
s,..., OPT — w{hr) — s, followed by blocks of length OPT. Note that each block 
has non-negative length. By equation (1) we obtain at most m! blocks and by adding a 
setup to each block and the jobs hi plus setup to the first r blocks, we can process each 
block on one machine. 

Consequently, if we apply these arguments to each connected component and add 
the removed exclusive machines again, we have shown that there is a schedule S' with 
makespan at most OPT -f s satisfying the required properties of the lemma. □ 

Lemma [43l proves the existence of a schedule that almost fulfills the properties 
of block-schedules, whose existence is the major concern in this section. However, 
it remains to show how to handle jobs that are split as we do not allow splitting or 
preemption of jobs and how to place exclusive machines belonging to M, which are 
not taken care of by the previous lemma, into the obtained schedule in order to yield a 
block-schedule. 

To simplify the description in the following, when we say we place an exclusive 
machine Mi before machine Mj , we think of a re-indexing of the machines such that 
the ordering of machines other than Mi and Mj stays untouched but now the new 
indices of Mi and Mj are consecutive. Also, a job j is started at the machine that 
processes (parts of) j and has the smallest index among all those processing j. A class 
Ci is processed at the end (beginning) of a machine if there is a job j G Ci that is 
processed as the last job (as the first job) on Mj. 

Lemma 4.4. A schedule fulfilling the properties of Lemma \4.3\ can be transformed into 
a block-schedule with makespan at most ^OPT. 

Proof. Consider an arbitrary class Oi. We distinguish three cases depending on where 
the jobs of Oi are placed in the schedule S' according to the proof of the previous 
lemma. 
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(1) There is a job in Ci that is split among two machines Mj and Mj+i. 

(2) There is no job in Ci that is split. 

(3) Q = 0. 

In case ([^ there is a job in Ci that is split, i.e. one part is processed until the 
completion time of Mj and one from time s on by Mj+i. Hence, we can simply place 
all machines of Ci between Mj and Mjj^i. Since jobs that are split have size at most 
^OPT — s, we can process any split job completely on the machine on which it was 
started increasing its makespan to at most |OPT. We repeat this process as long as 
there are jobs with property 0 left. Note that for each class Ci, after having finished 
case there is no split job left. 

In case 0, we distinguish two cases. If the jobs in Ci have an overall size of at 
most ^OPT (including setup), there either is no exclusive machine of Q and hence 
no violation of the block-property, or we can process the jobs on an exclusive machine 
of Ci increasing its makespan to at most ^OPT. If the jobs have an overall size of 
more than ^OPT, we distinguish whether Ci is processed at the end or beginning of a 
machine Mj or not. In the positive case, we can simply place any exclusive machines 
of Ci behind or before machine Mj. If Ci is not processed at the end or beginning of a 
machine Mj, there must be a second class Cii that is processed at the beginning and a 
third class Ci" that is processed at the end of machine Mj. Note that consequently the 
workload of Ci/ processed on Mj cannot be larger than ^OPT — s. We can perform 
the following steps on the currently considered machine Mj: 

1. Move all jobs from the class Q/ that is processed at the beginning of Mj to machine 
Mj-i if Ci' is also processed at the end of Mj_i, thus only increasing the makespan 
of Mj-i by at most ^OPT — s. 

2. Move all other jobs processed before some workload of Ci to one of their exclusive 
machines, if they exist. 

3. Shift all the workload w{Ci) to time 0 on machine Mj and shift other jobs to a later 
point in time. 

4. Place all exclusive machines of Ci in front of Mj. 

In case ([^, there are only exclusive machines. Such machines can simply be placed 
behind all other machines. 

These steps establish the block-schedule property and no jobs are split anymore. 
Also note that each machine gets an additional workload of at most ^OPT— s without 
requiring additional setups. Thus, the required bound on the makespan holds, proving 
the lemma. □ 

Theorem 4.5. Given an instance I with optimal makespan OPT, there is a transfor¬ 
mation to I' and a block-schedule for I' with makespan at most OPTbl '■= minjOPT -\- 
Pmax ~ 1; I OPT}- It can be turned into a schedule for I with makespan not larger 
than OPTbl- 
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Proof. The bound OPTbl < ^OPT directly follows from Lemma |4.4| and the fact 


that there are only transformations performed on instance I by Lemma 4.2 The sec¬ 
ond bound (which gives a better result if Pmax < \OPT) follows by arguments quite 
similar to those used before: If Pmax ^ ^OPT holds, we skip the transformation of 
Lemma 4.2 Additionally, in the proof of Lemma 4.3 we do not remove exclusive ma¬ 
chines (thus, considering all machines). Note that, since we skipped the transformation 
of Lemma 4.2 the set H is empty. Then, it is straightforward to calculate the second 
bound of OPTbl < OPT + pmax - 1- □ 


4.2 Grouping & Rounding 

In this section, we show how we can reduce the search space by rounding the involved 
processing times to integer multiples of some value depending on the desired precision 
e > 0 of the approximation. We assume that the transformations described in previous 
sections have already been performed. In order to be able to ensure that the rounding 
of processing times cannot increase the makespan of the resulting schedule too much, 
we first need to get rid of classes and jobs that have a very small workload in terms of 
OPTbl and e. In the following, we use A > 0 to represent the desired precision, i.e. 
A essentially depends on the reciprocal of e. We call every job j with pj < OPTbl /A 
a tiny job and every class Ci with w{Ci) < OPTbl I ^ a tiny class. 

Lemma 4.6. Given a block-schedule for an instance I, with an additive loss of at most 
AOPTbl/^ in the makespan we may assume that tiny jobs only occur in tiny classes. 

Proof. We prove the lemma by applying the following transformations to each class 
Cp. In a first step, we greedily group tiny jobs of class Ci to new jobs with sizes in the 
interval [OPTbl/A, 20PTblI^)- In a second step, combine the (possibly) remaining 
tiny grouped job j G Ci with a size less than OPTbl/K with an arbitrary other job 
/ G Ci. By this transformation we ensure that tiny jobs only occur in tiny classes and 
it remains to show the claimed bound on the makespan. 

First of all, focus on the first step of the transformation and assume that we do not 
perform the second step. Let S be the given block-schedule for instance I. Lemma 2.3 
in the work of Shachnai and Tamir ifTOll proves (speaking in our terms) that for the 
transformed instance there is a schedule S' with makespan of at most OPTbl + 
20PTbl/^- The proof also implies that S' is still a block-schedule: For each ma¬ 
chine Mj it holds that if Mj is configured for class Ci in the new schedule S', it has 
also been configured for Ci in the original block-schedule S. Thus, if S' is a block 
schedule, so is S' since we do not have any additional setups in S'. 

Now assume that also the second step of the transformation is carried out and con¬ 
sider the block-schedule S' we just proved to exist. Distinguish two cases, depending 
on where the tiny grouped job j G Ci, which was paired in the second step, is processed 
in schedule S'. If j was paired with a job j' and both j and j' are assigned to the same 
machine in S', the schedule S' already is feasible for the transformed instance (possi¬ 
bly after shifting j and j' such that they are processed consecutively). If the paired jobs 
j and j' are processed on different machines in schedule S', there is a schedule whose 
makespan is by an additive of at most larger than that of S'. To see this, note 

that in S' this case can happen at most twice per machine (for the classes processed at 
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the beginning and end of the machine). Hence, we can place any paired jobs j and / 
on the same machine yielding a schedule for the transformed instance with the claimed 
bound on the makespan. Finally, note that we can easily turn a schedule fulfilling the 
claimed bound on the makespan into a schedule for the original instance I satisfying 
the same bound on the makespan. □ 

Next, we take care of tiny classes that still might occur in a given instance. Again, 
without losing too much with respect to the optimal makespan we may assume a sim¬ 
plifying property as shown in the next lemma. 

Lemma 4.7. With an additive loss of at most AOPTbl/^ in the makespan we may 
assume the following properties: 

1. Each tiny class consists of a single job. 

2. In case that OPTbl/^ > s, it has size OPTbl/^ — s. 

Proof At first note that with an additive loss of at most 20PTbl /A in the makespan, 
we may assume that a tiny class is completely scheduled on one machine in a block- 
schedule. This is true because of reasons similarly used in the proof of the previous 
lemma: For each machine it holds that there are at most two different tiny classes of 
which some but not all jobs are processed on this machine. Hence, we may shift all 
jobs of such classes to one machine and thereby increase the makespan by at most 

20PTbl/\. 

Now distinguish two cases depending on whether OPTbl/^ > s or not. If this 
is the case, determine the length L of the sequence of all tiny classes (including setup 
times), round up L to an integer multiple of OPTbl/ \ remove all tiny classes from 
the instance and instead, introduce XL/OPTbl new classes each comprised of a single 
job with workload OPTbl/X — s. Observe that, given a block-schedule in which each 
tiny class is completely scheduled on one machine, we can simply replace tiny classes 
by these new classes, increasing the makespan by an additive of at most OPTbl/^- 
Also, this schedule implies a schedule for the instance in which tiny classes have not 
been grouped and its makespan is by an additive of at most OPTbl/X larger. This 
schedule is simply obtained by again replacing grouped tiny classes by its respective 
original classes. 

In case that OPTbl/^ < s, we simply group all jobs of a tiny class Ci to a new 
job j of the same size pj = w{Ci). Due to the fact that we might assume that a tiny 
class is completely scheduled on one machine, this proves the lemma. □ 


From now on, we assume that we have already conducted the grouping from the two 
previous lemmas and we describe how to round job sizes in order to reduce the search 
space for later optimization. The rounding approach is quite common for makespan 
scheduling. 

Given an instance /, we compute its rounded version I' by rounding up the size 
of each job to the next integer multiple of We know that there is a block- 

schedule with makespan at most OPTbl + and we also assume that the 

properties from Lemma 4.7 hold. 
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In case that OPTbl/^ > s each job has either a processing time of at least 
OPTbl/\ or forms a tiny class with workload at least OP'Pbl^x — s. On the other 
hand, in case that OPTbl/^ < s and there are tiny classes consisting of a single 
job, to execute such a job, we need perform a setup first which yields a processing 
time of at least opTbl/\ as well. Hence, we can have at most A + 8 jobs on one ma¬ 
chine in the considered block-schedule, leading to an additive rounding error of at most 
(A -f 8) • OPTBLjx^ in the makespan. Therefore, by choosing A appropriately, there is a 
solution to the rounded instance that approximates OPTbl up to any desired precision 
e > 0. 


4.3 Optimization over Block-Schedules 

We are ready to show how to compute a block-schedule for the rounded instance /' 
with makespan at most (1 + e)OPTBL for any £ > 0. The obtained schedule directly 
implies a schedule for the original instance I with the same bound on the makespan. 

Lemma 4.8. If all job sizes are a multiple and X > 0 is a constant, there 

is only a constant number Cd of different class-types. 

Proof We can represent any class Ci by a tuple of length A^ describing how many jobs 
of each size I ■ opTbl/\^, I < I < A^, occur in class Q. As each class has a size of at 
most 7 • OPT, each entry of the tuple is limited by yX'^ and there is at most a constant 
number Cd ■= {yX'^)^ of different tuples describing the classes of I'. In the following 
we say that all classes represented by the same such tuple are of the same class-type, 
proving the lemma. □ 


We can represent the classes that have to be scheduled as a tuple of size Cd where 
each entry contains the number of times classes of the respective class-type occur. 
Given a block-schedule S, we consider machine configurations that describe which 
classes are finished on the first i machines. We denote the sub-schedule induced by 
these first i machines by Si. 


Lemma 4.9. If all job sizes are a multiple of OPPblIx^ and X > 0 is a constant, 
the number of machine configurations representing Si for some block-schedule S and 
some i > 0 is bounded by a value Cconf that is polynomial in m. 


Proof. First, note that in a block-schedule S, for every Si, there is at most one class 
that is split due to the block-schedule property. Now, to uniquely define a candidate 
configuration, we need to store information about the classes that are finished, and 
in case a class has been split, the type of this class and which jobs of this class are 
finished. We reserve Cd entries for the finished classes, where each entry corresponds 
to the number of classes of the certain type that has been fully finished. Each entry 
is at most m • (A -I- 8) with similar arguments as in the proof of Lemma 4.8 and the 
reasoning concerning the maximum rounding error. For the class that has been split, 
we store the type of that class in an extra entry, which gives Cd possible values. If there 
is no class that has been split, we leave this entry empty adding another possible value 
to the entry. Finally, we store the number of jobs from the split class that have been 
finished for each job size as A^ additional entries, where each entry does not exceed 
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Cci • A similar to the structure in Lemma 4.8 Overall, we write a configuation as a tuple 
(ni,..., u\ 2 ) and thus there are at most Cconf ■= {m{X + 8))°'=' • {cd + 

1) • (cA)^ possible conhgurations, which proves the lemma. □ 


We now build a graph where we add a node for each machine conhguration. We 
draw a directed edge from node u to if and only if the machine conhguration corre¬ 
sponding to V can be reached from the conhguration u by using at most one additional 
machine with makespan not larger than (1 -f e)OPTBL- That is, assuming u is a pos¬ 
sible sub-schedule induced by the hrst i machines, we verify whether u is a possible 
sub-schedule induced by the hrst i -f 1 machines. We can do so as we assume that we 
have guessed OPT correctly and we can hence determine (1 + e) OPTbl which is 
the amount of workload we will ht on one machine. In order to determine the edges 
of the graph that describes our search space, we prove the following lemma, where 
we denote Is as the indicator variable which is 1 in case the boolean condition B is 
satished and 0 elsewise. Also, we dehne mpk to be the number of jobs of type k in 
class-type p, where k & A^} and p G Cd}- 

Lemma 4.10. If each configuration {ft, j,u) = (ni,..., nc^^ u\ 2 ) is repre¬ 

sented by a node, there is a directed edge from node V = (n, j, u) to V = {n, j, if 
and only if 


+ '^({uk-Uk)-k - 

r®+X! ^ ■ 

p—1 y y k—1 

< (l+e)-min|oPT + p„,,-l,^OPT|. (2) 

Proof We prove the statement for the following cases: 

1 - j 

First, note that the number of classes of type p G {1,..., Cci} that have been 
completed between node V and node V, i. e. on the additional machine, is ex¬ 
pressed in the value {ftp — Up). Now, in order to hnish all jobs from a class of 
type p G {1,..., Ccz}, we need to conhgure the machine for this class and after¬ 
ward, the workload of all jobs contained in that class type needs to be hnished. 
This leads to an overall processing time of s -f X]fe=i ^ for 

all jobs of the specihc class type. In case the class being hnished is j, there is 
still the same setup time, but there is less workload to be completed. This can be 
described by subtracting the amount of work already hnished in node V, which 
is '^k ' k ■ OPTBLfx^. Additionally, to reach the state represented by node 

V, class j needs to be set up and the workload depicted by u has to be completed 

^2 

yielding an additional processing time of s -f - opTblIx^, Summing 


OPTbl \\ 

V )) 
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all these times up, we get exactly the value on the left-hand side of inequality 

2 . j = j A 3i, > u^: 

In this case, we indeed have j = j, but as we have Ui > Ui for some i, there are 
more jobs of type i finished in V than in V. Thus, the class that had been partly 
executed at the end of V needs to be completed and the proof of case[T]similarly 
applies. 

3- j = j A Vi, Ui < Ui Au ^ u: 

Here, the scheduler does not necessarily need to finish the class that had been 
partly executed at the end of V. However, the overall necessary workload is the 
same whether the work on the current class is only continued and not finished 
(cost s for setting up the machine for the class) and a new class is fully executed 
(cost s) or whether it is finished (cost s) and a new class of the same type is 
initialized (cost s) and not finished. Thus, the proof of case[T]still applies. Note 
that this also holds if the number of classes of type j that have been fully finished 
is the same in V and V. 

4. j = j Au = u: 

In this case, we save an overall workload of s in comparison to the other cases. 
This is due to the fact that we do not need to perform a setup for class j as we 
can restrict ourselves to executing entire classes. 

Combining these cases completes the proof □ 

It is time to show that a schedule using only m machines and finishing all jobs 
exists. 


Lemma 4.11. We can construct a graph G such that there is a path from the node rep¬ 
resenting no job at all (source) to the node representing the entire instance I' (target) 
that has a length of at most m. 


Proof Using Theorem 4.5 there is a block-schedule with makespan at most OPTbl- 
Due to Lemma 4.6 and Lemma 4.7 together with the additive rounding error and a 
suitable value for A depending on e, there exists a solution to the rounded instance I' 
with makespan at most 


(1 + e)OPTBL = (1 + e) min{OPr + p^ax - 1, ^hOPT). 


By construction, the considered graph must contain a path describing this schedule, 
proving the lemma. Note that this naturally gives an approximation with factor at most 
(1 -f e) (OPT -f Pmax — 1) which is better in the case of Pmax < ^/20PT and which 
gives a PTAS for unit processing times. □ 


Theorem 4.12. By using breadth-first search on G, we can determine a schedule for 
the original instance I with makespan at most 


(1 -f e) min 


3 

2 


OPT, OPT -f Pmax 
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It implies an algorithm with exactly this approximation guarantee and runtime polyno¬ 
mial in n,k and m. 

Proof. Obviously, if we use breadth-first search on the graph, where the source ver¬ 
tex corresponds to the state where no job has been finished and the target vertex 
corresponds to the state where all jobs have been hnished, this gives a path p = 
(uo, Vi,... ,vi) of length at most m. By following this path and considering the dif¬ 
ference between two consecutive nodes pi-i and pi, we can efficiently determine the 
jobs from instance I' to be scheduled on machine Mi. The resulting schedule can be 
efficiently transformed back into the final schedule for instance / as already discussed 
during the description of the transformation we apply to I. Also, since the number of 
nodes is essentially the number of conhgurations, which in turn is polynomial in m, 
the search can be carried out efficiently. □ 


5 An Online Variant 

While in our original model discussed before we have assumed that all jobs are avail¬ 
able at time 0, also online variants are of fundamental interest. Consider a model in 
which a release time rj is associated with each job j and a job is not known to the 
scheduler before rj, i.e. jobs arrive in an online fashion. The objective remains the 
minimization of the makespan and we assess the quality of an online algorithm us¬ 
ing standard competitive analyses: An online algorithm is c-competitive if, for any 
instance, the makespan of the schedule computed by the online algorithm is by a factor 
of at most c larger than that of an optimal (offline) solution. 

A very simple lower bound on the competitiveness of any online algorithm can be 
obtained by exploiting the fact that any online algorithm cannot know the class of a 
job arriving later on in advance and hence, cannot perform a suitable setup operation 
beforehand. The following lemma shows that this fact results in a lower bound that can 
be arbitrary close to 2. 

Lemma 5.1. No online algorithm can be c-competitive for c < 2 — e and any e > 0. 

Proof. Consider an instance with (without loss of generality) m = 2 machines and the 
following adversary: At time 0 the adversary releases the first job of some class Ci 
with processing time pi = 1. Then, at time s a second job with processing time P 2 = 1 
is released belonging to a class for which the online algorithm has not performed a 
setup yet. Trivially, the optimal algorithm obtains a schedule with makespan s -f 1 by 
performing at time 0 a setup for the first job on one machine and one for the second job 
on the second machine, and then processes the two jobs until time s -t- 1. Any online 
algorithm cannot do better than performing a setup for the second job at time s and 
then processing this job. This directly implies a makespan of at least 2s -f 1. Hence, 
the competitiveness is at least which can be arbitrary close to 2 for large setup 

times s. □ 

Inim, Shmoys et al. present a quite general technique to turn an offline algorithm 
for a scheduling problem without release dates and an approximation factor of a into 
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a 2Q!-competitive online algorithm for the respective problems with release dates. Al¬ 
though this factor of 2 does not directly carry over to our scheduling problem since we 
also have to take into account setup processes, a slight modification yields the following 
result. 

Theorem 5.2. If each job is associated with a release time and jobs are revealed to 
the scheduler over time at these release times, our algorithm implies a polynomial time 
c-competitive online algorithm and c can be made arbitrarily close to 4. 

Proof. Although the proof is pretty much the same as that given in m, for the sake of 
completeness we state it again. Let 0 be the point in time where the first jobs arrive and 
call this set of jobs Sq. We apply our approximation algorithm and obtain a schedule for 
the jobs in Sq and let Fq be its makespan. Next we consider those jobs arriving between 
time 0 and Fq, call the set of them Si and compute a schedule for Si that begins at time 
Fq and ends at time Fi. Generally, we call the set of jobs released during the interval 
{Fi-i,Fi] the set Si+i where Fi is the point in time where the schedule for Si finishes. 
Then we schedule S'i+i using our approximation algorithm. 

Let Fi be the makespan of the entire schedule. We can determine an upper bound 
on Fi as follows; First, observe that Fl-l < Fi _2 + (1 + e){OPT + Prnax + s) 
since the approximation quality of our algorithm makes shure that we need at most 
(1 + e){OPT + Pmax + s) time to process the jobs in Si-i. Note that we may need 
the additional setup time s because the optimal schedule might have already performed 
necessary setups earlier. Second, consider the instance I' obtained from / by releasing 
the jobs of Si at time Fi_ 2 . We observe that Fi — Fi_i < (1 + e)(OPT + Pmax + s) — 
Fi -2 by the approximation quality of our algorithm and the fact that also the optimal 
solution cannot schedule jobs of Si before F;_ 2 . Putting both inequalities together we 
obtain Fi < 2(1 + e){OPT + Pmax + s) < 4(1 + e)OPT, proving the theorem. □ 

It remains an interesting question for future work, whether the gap between the 
lower and the upper bound can be narrowed by more clever lower bound constructions 
and/or a strategy specihcally tailored to the online scenario. 
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